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Abstract — In Cognitive Radio Networks (CRNs), secondary 
users (SUs) are allowed to opportunistically access the 
unused/under-utilized channels of primary users (PUs). To utilize 
spectrum resources efficiently, an auction scheme is often applied 
where an operator serves as an auctioneer and accepts spectrum 
requests from SUs. Most existing works on spectrum auctions 
assume that the operator has perfect knowledge of PU activities. 
In practice, however, it is more likely that the operator only 
has statistical information of the PU traffic when it is trading a 
spectrum hole, and it is acquiring more accurate information 
in real time. In this paper, we distinguish PU channels that 
are under the control of the operator, where accurate channel 
states are revealed in real-time, and channels that the operator 
acquires from PUs out of its control, where a sense-before- 
use paradigm has to be followed. Considering both spectrum 
uncertainty and sensing inaccuracy, we study the social welfare 
maximization problem for serving SUs with various levels of delay 
tolerance. We first model the problem as a finite horizon Markov 
decision process when the operator knows all spectrum requests 
in advance, and propose an optimal dynamic programming based 
algorithm. We then investigate the case when spectrum requests 
are submitted online, and propose a greedy algorithm that is 1/2- 
competitive for homogeneous channels and is comparable to the 
offline algorithm for more general settings. We further show that 
the online algorithm together with a payment scheme achieves 
incentive compatibility for the SUs while guaranteeing a non- 
negative revenue for the operator. 

I. Introduction 

With the ever-growing demand for wireless spectrum, Cog- 
nitive Radio Networks (CRNs) have been proposed to better 
utilize spectrum holes in wireless networks. In CRNs, sec- 
ondary users (SUs) are allowed to opportunistically access the 
channels of primary users (PUs). To utilize spectrum resources 
efficiently, an auction framework is often applied where an op- 
erator serves as an auctioneer and accepts requests from SUs. 
These frameworks are implemented via a resource allocation 
and a payment scheme with the objective of maximizing either 
social welfare or revenue Q, |Hl-||6l, lfT4l . 

Most existing works on spectrum auctions, however, assume 
that the operator has perfect knowledge of PU activities 
in a given period of time. They ignore the uncertainty of 
channel states caused by the uncertain and frequent PU usage. 



Hence, these existing auction schemes are mainly applicable 
to spectrum resources that tend to be available for relatively 
long periods of time. For instance, the interval between two 
adjacent auctions is assumed to be 30 minutes or longer 
in (4). However, to allow more efficient spectrum utilization 
and relieve spectrum congestion, spectrum holes at smaller 
time scales need to be explored. A straightforward extension 
of current approaches to this more dynamic environment 
would require auctions to be conducted frequently, which 
would incur high communication and management overhead. 
A more reasonable approach is to again consider a relatively 
long period of time, where the operator only has statistical 
information of the PU traffic when trading spectrum holes. 
More accurate information is acquired later in real-time. 
Therefore, an auction scheme that takes spectrum uncertainty 
into account is needed. 

To further improve spectrum utilization, besides trading 
spectrum holes that are fully under the control of the operator, 
as commonly assumed in the spectrum auction literature, 
the operator may choose to acquire licensed channels out 
of its control to further improve social welfare or revenue. 
To avoid interference with PUs, a sense -before -use paradigm 
must be followed in this case. The operator must first identify 
spectrum holes in a channel, e.g., by coordinating SUs to 
sense the channel, before allocating the holes to SUs. While 
spectrum sensing has been extensively studied in the CRN 
literature BD- llTOl . the joint problem of sensing and spectrum 
auction remains unexplored. 

In this paper, we propose a spectrum allocation framework 
that takes both spectrum uncertainty and sensing inaccuracy 
into account. In particular, we consider two types spectrum 
resources: PU channels that are under the control of the 
operator, and the channels that the operator acquires from 
PUs out of its control. In practice, wireless service providers 
(WSP) act as operators, and they may cover areas that almost 
completely overlap. SUs registered with one of them may 
access spectrum from other WSPs as will be introduced in our 
model. In both types of channels, PU traffic on each channel 
is assumed to follow a known i.i.d. Bernoulli distribution. 



For the first type of channels, the real-time channel state can 
be learned accurately by the operator. For the second type 
of channels, a sense-before-use paradigm must be followed, 
where a collision with the PU traffic due to sensing inaccuracy 
incurs a penalty. 

Using a fixed set of channels of each type, we study 
the joint spectrum sensing and allocation problem to serve 
spectrum requests with arbitrary valuations and arbitrary levels 
of delay tolerance. The objective of the operator is to maximize 
social welfare, which equals to the valuations obtained from 
successfully served requests minus the cost due to collisions. 
We consider both the scenario where the operator knows all 
spectrum requests in advance, and the setting when spectrum 
requests are submitted online. While our online setting is 
similar to the online spectrum auction schemes considered 
in 0, fT3lL the key difference is that sensing inaccuracy is 
not considered in these existing works. Hence, the approaches 
in 0, lfT3l can only be applied to cases where accurate real 
time channel states are obtainable, which is not always the 
case. 

Our contributions can be summarized as follows: 

• We model the joint sensing and spectrum allocation prob- 
lem as a finite horizon Markov decision process when 
all spectrum requests are revealed to the operator offline, 
i.e., ahead of time. We develop an optimal dynamic 
programming based algorithm, which serves as a baseline 
for the achievable social welfare. 

• We propose a greedy algorithm for the case when spec- 
trum requests are submitted online. We prove that the 
online algorithm is 1/2-competitive for homogeneous 
channels, and we show that it achieves performance 
comparable to the offline algorithm for more general 
settings by numerical results. 

• We further extend the online algorithm by introducing 
a payment scheme to ensure incentive compatibility for 
SUs while guaranteeing a non-negative revenue for the 
operator. 

The paper is organized as follows: The system model 
and problem formulation are introduced in Section HH Our 
solutions to the problem with offline and online requests are 
presented in Sections |III1 and [IVl respectively. The online 
auction scheme for ensuring incentive compatibility for SUs 
and non-negative revenue for the operator is then discussed 
in Section [V] In Section [VII numerical results are presented 
to illustrate the performance of the greedy online algorithm 
in general cases, and the tradeoff between social welfare and 
revenue. We conclude the paper in Section IVII1 



II. System Model and Problem Formulation 
We consider a cognitive radio network with a single operator 
and multiple SUs registered with it (see Figure []])• The 
operator manages multiple orthogonal channels and controls 
the corresponding network composed of PUs. We focus on 
downlink transmission at the operator with power control. 
A time slotted system is considered with all PU and SU 
transmissions synchronized. All SUs are assumed to be in the 
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Fig. 1. System model of the CRN. In the left figure, small circles are SUs, 
squares represent PUs registered at the operator, and triangles are PUs out 
of the operator's control. The big circle is the coverage area of the operator. 
The right figure shows the availability of channels in T\ and T2. The big 
circle means the channel is busy and the small red circle means the channel 
is sensed idle. 

interference range of each other and that of PUs, and hence 
each channel can be assigned to at most one SU at any time 
when it is not used by PUs. 

The spectrum pool consists of two types of channels, those 
managed by the operator and those that are not. The operator is 
aware of the downlink activity of its own PUs at the beginning 
of each time slot. The set of the spectrum band£| managed by 
the operator is denoted by T\ . However, the activities of PUs 
not managed by the operator are unknown. Bands accessed 
by these PUs are denoted by T2. To access bands in T2, SUs 
cooperatively sense them and report their sensing results to 
the operator. The operator then makes a fusion decision on 
the activities of bands in T2 and selects a subset of channels 
sensed idle to serve the SUs. We only consider the set of 
PUs located in the coverage area of the operator so that all 
SUs in the system have the cognitive capability and can sense 
spectrum in T2. We assume that the sensing cost is low and 
even negligible. In practice, wireless service providers (WSP) 
act as operators, and they may cover areas that almost overlap. 
SUs registered with one of them may access spectrum from 
other WSPs as introduced in our model. 

We assume that the spectrum bands in T\ and T2 have the 
same capacity, which is normalized to 1. PU activities on these 
channels follow an i.i.d. Bernoulli distribution in each time 
slot. For instance, in Figure [T] there are three channels in T\ 
and two channels in T2. In time slot 1, channels 2 and 3 in 
T\ are idle and channel 1 in T2 is idle. However, channel 
1 in T2 is sensed busy and it will not be allocated. Also, 
channel 2 in T2 is incorrectly sensed to be idle and scheduling 
a request on this channel will lead to a collision. We let 
denote the probability that channel i in T\ is idle and 7T2 (j) the 
probability that channel j in T2 is idle. We also assume that 
the prior distribution of the PU activity is accurately acquired 
over time. We assume that state changes occur at the beginning 
of a time slot. Let C = \Ti \ + IT2I denote the total number 
of channels, which remains constant over time. Some of our 
technical results apply to the special case when all channels in 
T2 are homogenous, that is, when the channels have the same 
^2(2)' Pm{i) and Pf(i). Thus, they also have the same Po(i) 

! We use channel and spectrum band interchangeably. 



TABLE I 

Notation List 



Symbol 


Meaning 


M 


Set of spectrum requests submitted to the operator 


Ti 


Set of channels managed by the operator 


T 2 


Set of channels not managed by the operator 


7Tl(i) 


Probability that channel i in T\ is idle 


7T2(j) 


Probability that channel j in T 2 is idle 


C 


The total number of channels (C = |Ti | + |T 2 1) 


h(t) 


Availabilities of channels in T\ at t 




Availabilities of channels in T 2 at t 


m) 


Sensed availabilities of channels in T 2 at t 


p f (k) 


Probability of false alarm for channel k G T 2 


Pm(k) 


Probability of misdetection for channel k G T 2 


Pj(fc) 


Probability that channel /c is sensed idle 


Po(fc) 


Probability of channel k being idle given that it is sensed idle 




Earliest service time for request i 


di 


Deadline of request i 


Wi 


Valuation of the request i 


H 


The time period where spectrum allocation has to be made 


r 


Maximum number of outstanding requests in the system at any time 


Q 


Penalty price per collision 



and P/(z). 

The availabilities of channels in T\ and T 2 at t are denoted 
by binary vectors h(t) = (ll(t), • • • , if (t), • • • ) and T 2 (t) = 
(I 2 (£), • • • , l" 2 (t), ■ • • ), respectively, where represents idle 
and 1 represents busy states. Moreover, I 2 (t) denotes the 
sensed availabilities of channels in T2 at £. Let Pf(k), k G T2, 
denote the probability of false alarm for channel fc, i.e., 
the probability that SUs cooperatively sense channel k to be 
busy given that it is actually idle. Let P m (k) represent the 
probability of misdetection for channel k, i.e., the probability 
that SUs cooperatively sense channel k to be idle given that 
it is actually busy. We further define Pi(k) as the probability 
that channel k is sensed idle and Po(k) as the conditional 
probability of channel k being idle given that it is sensed idle. 
Note that Pj (ft) = 7r 2 (fc)(l - Pf(k)) + (1 - 7r 2 (fc))P m (fc) and 
P (fe) = n2{k)i p~^ fm . We assume that P f (k) and P m (fc) 
are constant for any channel fc G T 2 , which occurs e.g. when 
SUs are static in the system. 

We assume each spectrum request is for a single time- 
frequency chunk, i.e., a single time slot of any channel in 
T\ or T2. Each request i submitted at time t is of the form 
(ai, di,Wi), where ai > t is the required service starting time, 
di is the deadline, and Wi is the valuation of request i, which 
will be added to the social welfare if request i is served by di. 
We denote the set of requests by J\f = {1, • • • , N}. H = 
max iG Af<^ — mm ie x a i denotes the time period spectrum 
allocation needs to be made, and mm ie j\f ai is normalized 
to 1. The maximum number of outstanding requests in the 
system at any time is denoted as r. Table U summarizes the 
notations used in the paper. 

We are interested in maximizing the social welfare of the 
operator and the SUs in the system: When an SU submits 



a request, the operator will not make a commitment on the 
service; the valuation Wi is added to the social welfare if 
request i is served by di . The social welfare is defined as the 
total valuations from the requests served minus the collision 
cost to channels in T 2 . Let Q denote the penalty incurred per 
collision. Let xu(t) (i G J\T, I G T\ UT 2 , t = 1, • • • , H) denote 
the allocation indicator: xu (t) = 1 if request i is allocated to 
channel I at t\ xu(t) = otherwise, yi denotes the service 
indicator: yi = 1 if request i is served by df, yi = otherwise. 
The social welfare maximization problem is then formulated 
as follows, where Z(-) denotes the number of elements in 
a vector: 

Problem (A): 

H 

maxEi 1? i 25 i| -Q^E X^^^W 

x ' y ieJV ieAfieT 2 t=i 

di 

s .t. J2 ( E M*) + E a; *fc(*)( 1 - J 2 (*))) ^ d) 

t=a,i l£Ti keT 2 

for alii G TV 

^ ^ x u (t) < Z(h(t)\ for alU = 1, • • • ,H (2) 
5] E Xik(t) < Z(Ii(t)), for alU = 1, • • • , H, (3) 

iGA/"/eGT 2 

where x = (xu(t))i^u V = (Vi)ieM, Ii = (h (t))t=i,- ,h, 
I 2 = (I2(t))t=i,-,H, II = (If(£))t=i,...,tf. The cost 
QJ2ieM^ieT 2 EtLi^W^W takes int0 account the cur- 
rent availabilities of channels in T 2 . Inequality (HJ reflects the 
relationship between the allocation indicator xu(t) and the 
service indicator y im Inequality © guarantees that a channel 
in T\ will not be allocated unless it is observed idle. Likewise, 
Inequality © guarantees that a channel in T2 will not be 
allocated unless it is sensed idle. 

The challenges of Problem (A) are threefold: 1) The re- 
quests are uncertain since they may be submitted at different 
time slots; 2) Spectrum availabilities of T\ and T2 in the 
future are not known at the current time slot; 3) Sensing is not 
accurate for channels in T 2 . In the following, we propose an 
offline optimal solution in Section [Till and an online solution 
in Section [IV] We define the offline algorithm as an algorithm 
that decides the channel allocation for outstanding requests in 
each time slot with only the observed availabilities of channels 
in Ti and sensed availabilities in T2 of the current slot. All 
requests, including future arrivals, are assumed to be known. 
For instance, SUs submit their requests at the beginning of 
H. The operator then knows the full arrival information. In 
each time slot, the operator has to make channel allocation 
decisions based on the observed availabilities of its own 
channels and the sensed availabilities of channels managed by 
other operators. The only difference between online and offline 
algorithms is that online algorithm does not assume the full 
arrival information to be known ahead of time. Both algorithms 
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are designed under the challenges of spectrum uncertainty and 
sensing inaccuracy. 

III. Optimal Offline Algorithm 

In this section, we study Problem (A) under the assumption 
that the operator has full knowledge of the spectrum requests 
in advance. By our assumptions on channel statistics, the 
problem can be modeled as a finite horizon Markov Decision 
Process (MDP) fT2ll . In this section, we propose an optimal 
dynamic programming based solution to the problem. We start 
with the simple case where T\ — and all the channels 
for serving SUs are in T2, which models the case where all 
the channels owned by the operator are overloaded by PU 
traffic. Then, we proceed with the general case where both T\ 
and T2 channels are available for use in the system. In each 
time slot, based on the knowledge of the spectrum requests 
and the current channel state, the operator makes a joint 
decision including 1) which subset of requests to schedule; 
2) which subset of channels to allocate; 3) which request 
to assign to which channel. In our solution, we consider all 
possible scenarios for each time slot and find the schedule 
that maximizes the expected social welfare. Note that the 
social welfare is composed of two parts: Valuations of SUs 
that are served and the cost caused by collisions on the 
channels in T2 . We show that our algorithm has a complexity 



of 0(2 r 3 G (max {C,r}) 



min {C,r} 



HCr). When (a^,^) of re- 



quests do not have a dense overlap, i.e., r = 0(\ogN) where 
N is the total number of requests in [1, H], our algorithms are 
of polynomial complexity. 

We first define F(D,t) as the maximum expected social 
welfare from the beginning of slot t till the end of slot H 
given that the set of outstanding requests is D. The expectation 
takes into account all possible channel realizations and sensing 
results. We define F(D,H + 1) = for all D. Our goal 
is to calculate F(D, 1) where D = {i : i G A/", ai = 1} 
We calculate it backward from t 



(Algorithm IIII.lt . We calculate it backward from t = H till 
t = 1 is reached since requests requiring service in future 
time slots have an impact on the current optimal scheduling 
decision. Note that at any time t, it is sufficient to consider D 
in F(D,t)'s for being any subset of the requests that satisfy 
di < t < di. 

A. With no available channels in T\ 

When no channel is in T\ , the spectrum bands managed by 
the operator, SUs can only be served by channels in T2. SUs 
may request spectrum in arbitrary time slots. The success of 
serving request i contributes Wi to the social welfare while the 
assignment of a request to a busy channel causes collisions, 
incurring a penalty of Q. 

We define X(D,S,t) as the maximum expected social 
welfare from t (t = 1 , • • • , H ) to the end of the period, given 
that the set of outstanding requests is D and channels in S 
are sensed idle (S C T2). The expectation is taken over all 



possible realizations of 1 2 . Then 

X(D,S,t) = 



max 

x(t) 



e n p o(o n a-^M) 

s 1 csies 1 mes\Si 



(W(D,S,S 1 Mt),t)+F(D',t+l)) 



,(4) 



where W(D,S, S u x(t),t) 

= E [ Wn E x nk (t) 



Q £ 

neD keSx kes\S! 

is defined as the social welfare achieved in time slot t, for 
a given D, the set of outstanding requests; S C T2, the set 
of channels sensed idle; Si C S, the set of channels that are 
sensed idle and actually idle; and x(t), the channel allocation 
at t. Recall that x n k(t) is the allocation indicator used to 
determine whether the SU is served by this allocation. We 
form D' based on D as follows: If request m is allocated to 
channels in Si, then remove m from D, which means it is 
served and the request no long exists. If request n satisfies 
Q>n = t + 1, then add n to D, which indicates it is a new 
request. Among the remaining requests, those that expire at 
the beginning of t + 1 are removed from D. 

Based on X(D, S, t), we calculate F(D,t) as follows. The 
expectation in F(D, t) in the form of the product of Pi (I) and 
(1 — Pj(ra)) takes into account all realizations of I|. 

F(D,t)= EI1 P H0 II {l-Pi(rn))X(D,S,t), 

SCT 2 leS mer 2 \s 

(5) 



In Algorithm MI. 11 our objective F({i : i £ = 1}, 1) 

is calculated by dynamic programming. It first calculates the 
maximum social welfare and the corresponding schedule for 
each time slot, and then specifies the real time operations. 
Lines 1-6 calculate F(D,i) backward from H to 1 given 
the initial condition defined earlier F(D,H + 1) = for 
all D. Line 5 calculates the optimal scheduling policy for 
time t given D, the request set; S, the set of channels sensed 
idle; and Si, the set of channels sensed idle and actually idle, 
according to Equation ©. The value of F(D,i) is updated 
in Line 6 according to Equation ©. The complexity of the 
Equation © is 0(3^1 (max {|T 2 |, r }) min ^ T2 l' r >|T 2 |r): The 
number of possible channels realizations is 3' T2 I since different 
social welfare values will be generated in the cases where 
the channel is sensed idle but actually busy, it is sensed 
idle and actually idle, and all other cases. It takes at most 
(max{|T2|, r }) mm {l T 2l> r } combinations to find the optimal 
x in Equation ©. The complexity for the calculation of 
W(D,S,Sux(t),t is 0(\T 2 \r). On the other hand, given t, 
the number of possible argument combinations in F(D,i) 
is 0(2 r H) by assumption. The total time complexity is 
0(2 r 3 c (max{C,r}) min ^ C ' r >i^Cr). Note that C is assumed 
to be a constant in our model. 
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Algorithm III.l Dynamic Programming based Optimal Algorithm for Social Welfare Maximization 
Offline computation 
for t = H to 1 do 



for all D C {i : < t < di} do 
for all SCT 2 do 



X(D,S,t) <- max 

x(t) 



E EI fl)(0 EI (l-^o(m))(E w n J2x nk (t)-Q £ z nfc (*) + 

5iC5/G5i mG5\5i neD 1 keS ± fceS\Si J 

e n^(o n (i-P/(m))x(A5,t) 

SCT 2 leS meT 2 \S 



Real-time scheduling 

1: At each time slot t with a set of requests D that are currently in the system and a set of channels S that are sensed idle, 
allocate channels to the requests based on the schedule x(t) that maximizes X(D,S,t). 



B. With at least one channel in T\ 

With channels in Ti, requests can be served by channels 
in both T\ and T2. Since the channel availabilities of T\ are 
known at the beginning of each time slot, they can serve SU 
requests without any cost. Thus, once observed idle, channels 
in T\ could be assigned to requests so as to maximize the sum 
of valuations. Our focus is still the allocation of channels in 
T 2 if they are sensed idle. 

We define Y(D,S,t) as the maximum expected social 
welfare from t to the end of the period, given that the set 
of outstanding requests is D and channels in S are sensed 
idle (S C T2). We also define X(D, T, S, t) as the maximum 
expected social welfare from t to the end of the period, given 
that the set of outstanding requests is D, channels in T are 
observed to be idle (T C Ti), and channels in S are sensed idle 
(S C T2). The expectation in Y(D, S, t) is for all realizations 
of Ii and I2. The expectation in X(D,T, S,t) is for all 
realizations of 1 2 . Then, 



X(D,T,S,t) 



max 

x(t) 



e n p o(o n 



s 1 csies 1 



m6S\Si 



(W(D, T, S, 5i, x(t), t) + F(D', t + 1)) 



(6) 



where 



neD ler 



keS! 



1 E 

keS\S-. 



%nk (t) 



is defined as the social welfare achieved in time slot t, given 
D, the set of outstanding requests; T, the set of channels in 
T\ that are observed to be idle; S, the set of channels sensed 
idle; Si, the set of channels sensed idle and actually idle; 
and x(t), the channel allocation at t. The only difference 
between W(D, 5, Si, x(t), t) and W(D, T, 5, Si, x(t), t) is 
the addition of the valuations contributed by the service on 
channels in T±. We form D' based on D in a similar way 
to Equation ©: If a request m is allocated to channels in 



Si U T, then remove m from D, which means it is served and 
the request no long exists. If request n satisfies a n = t + 1, 
then add n into D, which indicates it is a new request. All 
other SU i are removed from D only when di = t + 1. We 
then calculate Y(D,S,t) as: 



Hence, 



Jtti(0 (l-7n(m))X(£>,r,5,t). 



rcTi ier 



meTi\r 



(7) 



W*)=EII^(0 II (i-P/M^A^t), 

scr 2 ies mer 2 \s 

(8) 

which takes into account all realizations of Ii, the availabilities 
of channels in Ti; I|, the sensed availabilities of channels in 
T2; and I2, the actual availabilities of channels in T2. 

The algorithm is similar to Algorithm IIII.ll except that 
F(D,t) is updated according to Equation ([8]). The complexity 
of Equation ® is 0(3l T2 l2l Tl l (max {|T 2 |, r }) min ^ T2 l' r >Cr). 
The difference from the complexity of Equation © lies 
in 2l Tl l, which is caused by the number of T\ chan- 
nel realizations. Following a similar argument as in the 
case where |Ti| = 0, the total time complexity is still 
0(2 r 3 c (max{C,r}) min ^ C ' r >i^Cr). Note that, for homoge- 
neous channels in T2, the allocation policy becomes simpler 
since allocation to different channels in T2 makes no differ- 
ence. Then, we can replace (max{C, r}) mm ^ C ' r ^ with r in 
total complexity, resulting in a complexity of 0(2 r 3 c HCr 2 ). 

C. Discussion 

In this section, we prove some structural properties of 
the optimal solution, which helps to further reduce the time 
complexity of the algorithm and also provides insight to the 
design of the online algorithm discussed in Section [IVJ Note 
that at any time t, for an active request i and a channel 
k e T 2 that is sensed idle, P (k)wi - Q(l - Po(k)) is the 
expected immediate social welfare contributed by request i if 
i is assigned to k in the current slot. Proposition IIII.ll shows 
that a non-negative immediate social welfare is necessary for 
request i to be served by channel k in the optimal solution, 
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which turns out to be a sufficient condition in certain scenario 
as stated in Proposition MI. 21 as well. 

Proposition III.l. At any time t, if a request i is scheduled 
on channel k G T2 in Algorithm then Po(k)wi > Q(l — 
Po(k)). 



Proof: We will prove Proposition MI. II for the case 
without T\ channels. It can be shown for the general case 
in a similar way. Suppose at time t, request i is assigned to 
channel k in the optimal solution, with the system state being 
(D,S,Si) as defined before. Note that k may or may not be in 
Si. Let x(t) be the optimal schedule, Ds be the set of requests 
scheduled in x(t), and D' s s be the set of outstanding requests 
for t+1 with Ds scheduled in t. Let x(t) be the same schedule 
as x(t) except that i is excluded. To simplify the notation, let 

R(S, Si) = n p o(0 EI (1 - Po(m)). Then we have 

leSi mes\S! 



X(D,S,t) 



E ii(5,5i)(W(A5,5i,x(t),t) 

s 1 cs,kes 1 



+Wi + F(D' SfSl ,t + l)) 



E R(S,Si)(W(D,S,Si,m,t) 

S 1 CS,k(£S 1 



Q + F(D' SiSi ,t + l)) 



(9) 



On the other hand, if i is not scheduled, then the expected 
social welfare from t to H is 



X'(D,S,t) 



JJ(5,5i)(W(£>,5,5i,i(t),t) 



SiCS.feeSi 



E ii(5,5i)(W(A5,5i,x(t),t) 

L5iC5,fc^5i 



(10) 



+ KP (A:)-Q(1 



Po(fc))) E R (S>Si)>°> ( n ) 

5iC5\{/c} 



where F(D^ Si , £ + 1) - F(D^ Si U {i}, £ + 1) < since the 
social welfare is monotonic over the set of requests. Hence, 
we must have Po(k)wi > Q(l — Po(k)) for Inequality (TTTb to 
hold. ■ 
Proposition MI. 21 shows that the condition Po(k)wi > Q(l — 
Po(k)) is also sufficient for a request to be scheduled for 
homogenous channels. To simplify notation, we drop the index 
for channel related parameters for the homogeneous case. 

Proposition III.2. In a system with no channels in 7\ and 
homogeneous channels in T%, if there exists at least one request 
i that satisfies P^Wi > Q(l — Po) in a slot t and there is at 
least one channel sensed idle, then in the optimal solution 
at least one of the requests satisfying this condition will be 
scheduled, for all t. 

Proof: Given the system state (D,S,Si) at time t, 
consider a subset of requests Ds C D to be scheduled where 
i G Ds. Let D' s s denote the set of the outstanding requests 
at t + 1 given that Ds is scheduled at t, and k the channel as- 
signed to i in the schedule. We want to show that the expected 
social welfare from t to the end of the time period with Ds 
scheduled at t is at least as large as that with Ds sched- 
uled. We define U(T 2 ,S) = U Pi(l) U (1 - Pj(ra)). 

les meT 2 \s 
We also define Fi(D,t) as the expected social welfare from 
t till the end of H by scheduling Ds at time t and P2 (£>,£) 
as the expected social welfare from t till the end of H by 
scheduling Ds \ {i} in time t. By Equations ©, (fTOl) , and 
(TTTb we obtain 



F 1 (D,t)-F 2 (D,t) = U(T 2 ,S) 

SCT 2 



\Si\ 



(1 _ Po) |SVn| 



ls 1 cs,kes 1 



<WiP - Q(l - P )) E P o Sl1 (! - p o) 

SiCS\{k} 



E U F*> ^ [ P o(*W, Sl ,*+!)" U {*}, £ + 1)) 

5CT 2 

KPo-Q(l-Po))] E ^'(l-^o) 1 ^ 51 '- 1 (12) 



5iCS\{fc} 



In the following we will show that Pq(F(D' s s U {i}, £ + 
!) - F ( D s,Si^ + !)) ^ ^ P o - Q(l - Po). We first 
observe that F(D,t) < P(P \ + P({<M) for 

any £, P and z G D since z is competing with requests 
in D \ {i} for the spectrum in the former case but not 
in the latter case, in Hence we only need to prove that 
Po(F({i},t + 1) - P(0,t + 1)) < Wi Po - Q(l - Po) 
for all t, that is, P F({i},t + 1) < ^P - Q(l - P ) 

E ii(5,5i)(P(i5^ 5l ,t + l)-P(i5^ 5l U{0,* + l))f or a11 * since + X ) = °- We wil1 P rove [t h y 

s 1 cs,kes 1 induction. We define Pj as the probability that at least 



Since is the optimal solution for Equation (|4]), we have 
(l9l)-([TQl)> 0. By rearranging the terms, we obtain 

J2 R(S, Si)(F(D> s , Si , t + 1) - F(D' S}Si U {*}, t + 1)) 
s 1 cs,kes 1 

+wi E R(S : Si)-Q E R&S!) 
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one channel is sensed idle. We start with t = H and then 

P F({i},H) = PoPiiwiPo-Qil-Po)) < WiP -Q(l-P ). 
Suppose P F({i},t) < WiP - Q(l - P ) for all 
t > r, then F({i},r) = (1 - Pi)F({i},r + 1) + 
Pi max {w z P + (1 - Po)(F({i}, r + 1) - Q), r + 1)} 

We calculate 

r + 1) - ( Wi Po + (1 - ^(^(W, r + 1) - Q)) 

= PoC^CW, r + 1) - (^Po - Q(l - P ))) < 0, 

where (a) is by the induction assumption. Then we know i 
should be scheduled in the optimal solution at r if it is the 
only request. Hence, 

PoF({i},r) = P ((l - P/)P(W,r + 1) 

+P/(wiPo + (1 - Po)(P(W, r + 1) - Q))) 

= (1 - P Pj)PoP({i}> t + 1) + PoPi(w z P - Q(l - P ))) 

(6) 

< ^Po-Q(l-Po)), 

where (b) is by the induction assumption. Thus Pq(F(D U 
- F{D,t)) < WiP - Q(l - P ) for all L> and t. 
Therefore, Equation ([T2l)> 0, which means the expected social 
welfare from t to the end of the time period with Ds scheduled 
at t is always better than that with Ds scheduled. ■ 
Based on these propositions, we can reduce the candidate 
set of requests for scheduling in each time slot. For instance, 
no requests should be scheduled if Po(k)wi < Q{\ — Po(k)) 
for all existing requests i and all k sensed idle. Also, in a 
system with no channels in T\ and homogeneous channels in 
T2, the candidate set is composed of all requests that satisfy 
Po(k)wi > Q{1 — Po(fc)). We utilize these propositions in the 
design of our online algorithm. 

IV. Online Algorithm 

In this section, we introduce a greedy online algorithm 
(Algorithm lIV.ll) that does not need future arrival information. 
For systems where requests are not submitted ahead of the 
required service starting time a$, the online algorithm makes 
decisions based on the information available in the current 
slot. 

In Algorithm IIVU the main idea is to (greedily) offer 
requests with higher valuation channels with better quality. 
We define c k = Q{\ — Po(k))/Po(k), which is the expected 
cost of serving one request on channel k (will be shown in 
Lemma HvB . Note that c k = for k G T t . Lines 3 and 4 
sort channels sensed idle by c& and current requests by Wj, 
respectively. Since accessing channels in T\ causes no cost if 
observed idle, they are allocated first to requests with highest 
valuations (Lines 6-11). In Lines 15-22, the remaining requests 
are allocated to channels in T2 sensed idle from highest val- 
uation to lowest if they satisfy w n > 6(k) where 6(k) serves 
as a reservation price for using channel k. We set 6{k) = c k 
in this section, which is motivated by Propositions MI. II and 
MI. 21 Allowing different values of reservation price provides a 



Algorithm IV. 1 Greedy Online Algorithm 
In each time slot t: 

l: if D = then 

2: exit 

3: Sort channels in S (channels sensed idle in T2) by c& in 

ascending order 

4: Sort requests in D (outstanding ones) by Wj in descending 

order 

5: i <- 1 

6: for all I in T (channels in T\ observed idle) do 

7: Xu(t)<-1 

8: D <- D \ {i} 

9: if D = then 

10: break 

11: i <- i + 1 

12: if D = then 

13: exit 

14: n <- |r| + 1 

15: for all k in S (channels sensed idle in T2) do 

16: if w n < 0(k) then 

17: break 

18: X nk <r- 1 

19: D 4r- D\ {n} 

20: if D = then 

21: break 

22: n <— n + 1 



way for trading off the social welfare and the revenue of the 
operator, which will be discussed in detail in Section IV1 

The time complexity of Algorithm II V. 1 1 is 0(C log C + 
rlogr) since the complexity of sorting in Lines 3 and 4 
dominates that of allocation in Lines 5-22. We then study the 
performance of the online algorithm. An online algorithm for 
a maximization problem is c-competitive (c < 1) if it achieves 
at least a fraction c of the objective value of an optimal offline 
algorithm for any finite input sequence Q. We show that the 
greedy online algorithm is 1/2-competitive when |Ti| = 
and channels in T2 are homogeneous in Proposition lIV.il For 
heterogenous channels, we will show that the online algo- 
rithm achieves performance comparable to the optimal offline 
algorithm by numerical results in Section [VH To establish 
Proposition IIVU we first show that c k is the expected cost 
per a request served by channel k in Lemma HV.ll 

Lemma IV. 1. For any scheduling policy, c k is the expected 
cost of serving a request on channel k. 

Proof: For any scheduling policy, consider the time 
interval right after a request is served by channel k and before 
the next request is served by channel k. Remove all time slots 
in the interval when there are no requests in the system or 
channel k is sensed but not allocated. Given that a channel is 
sensed idle, the probability that collision happens is 1 — Po(k). 
Thus the number of slots where collisions happen follows 
a geometric distribution and the expected cost per a request 
service on channel k is Q{\ — Po(k))/Po(k). ■ 
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Based on Lemma IIV.U we show the competitive ratio of 
Algorithm II V. 1 1 in special cases. 

Proposition IV. 1. If \T\\ = 0, and channels in T2 are 
homogeneous, then Algorithm \IV1\ is 1/2-competitive. 

Proof: Let the random variable 7 denote the set of 
requests that are eventually served by the algorithm. Let 
Po = Po(k) for any channel k G T2. Since the channels in 
T2 are homogeneous, we have c = Q(l — Pq)/Pq, which is 
the expected cost for serving a single request by Lemma fTvTI 
Then the expected social welfare can be written as follows: 

N 

E[(E pr(7)E^)- fccPr (N = fc ). 

k=l \"y\—k i£7 
N 

= E[E Pr ^E(^- c ). 

k=l \~f\=k ^€7 

Note that the greedy algorithm always chooses the active 
request with highest valuation. For any sample path, consider 
the set of requests served by the optimal offline algorithm 
and those by the greedy algorithm with w [ = Wi — c as the 
valuation. We follow the same argument as in 0: We consider 
any request i that is scheduled offline but not online. Since 
request i is not scheduled online, it is present at time t and 
the greedy algorithm schedules another request j in that slot, 
the valuation of request j should be as least as large as that of 
request i. For any request i that is allocated offline and also 
online, it makes the same contribution to the social welfare. 
Then the offline solution achieves a social welfare at most 
twice that in the online solution since , , j ? > ^ . Therefore, 

w-\-w- — 2 ' 
I 1 1 3 

Algorithm II V. 1 1 is 1/2-competitive. ■ 
Note that the factor 2 in Proposition II V. 1 1 does not depend 
on request arrival patterns or channel related parameters. 
Algorithm II V. 1 1 can always achieve at least \ of the social 
welfare of the optimal offline algorithm (Algorithm lIII.il) when 
the system is only composed of homogeneous T 2 channels. 

V. Achieving Incentive Compatibility 

In this section, we design an online auction scheme which 
utilizes the online greedy algorithm (Algorithm lIV.ll) together 
with a payment scheme to achieve incentive compatibility for 
SUs. Due to the collision penalty, however, a social welfare 
optimal auction may end up with a negative revenue for 
the operator, which is not reasonable since the operator may 
choose not to start the auction in the first place. We introduce 
a reservation price for resolving this problem, which also 
provides a way of trading off the social welfare and revenue. 

A. Incentive Compatibility for SUs 

When the available spectrum resource cannot satisfy all the 
requests, which is often the case, a selfish SU may choose to 
cheat on its valuation or arrival and deadline times to obtain 
some priority of being served. Such strategic behavior leads 
to a less efficient system. In this section, an online auction 
scheme is presented (see Auction 1) to suppress the cheating 



behavior. At any time slot t, the operator accepts bids of the 
form (di^di^Wi), where di = t and di denote the reported 
required service starting time and the deadline, respectively, 
and Wi denotes the reported valuation. All these values could 
be different from the true values of request i. We assume there 
is no early-arrival misreport and late-departure misreport in the 
system, that is, di > ai and di < di in any bid. In practice, 
both of them can easily be detected since the request is no 
longer in the system when either misreport occurs. 

Let pi denote the payment that the operator charges a SU 
for having its request i served. The net utility for request i is 
defined as: Ui = Wi — Pi if request i is served and Ui = 
if not. A mechanism is said to be dominant- strategy incentive 
compatible (DSIC) if for any given sample path of channel 
state realizations and sensing realizations and a set of requests, 
each request maximizes its utility when it truthfully reveals the 
private information independent of the bids from other requests 
(adapted from Definition 16.5 in Lllj). In Auction 1, for every 
request that is successfully served by its deadline, a critical 
price is charged, which is defined as the maximum reported 
valuation under which it will not be served assuming the other 
bids are fixed. 

Auction 1: Requests (di^di^Wi) are reported to the operator 
at time t = di. 

(i) At the beginning of each t, allocate requests according to 
Algorithm UVU 

(ii) Every request successfully served pays its critical price, 
collected at its reported deadline. 

Proposition V.l. Auction 1 is DSIC with no early-arrival and 
no late -departure misreports. 

Proof: According to Theorem 16.13 in ifTTIl . to show that 
Auction 1 is DISC, it is sufficient to show that the mechanism 
is monotonic in terms of both valuation and timing. That is, 
for a given sample path of channel realizations and sensing 
realizations and a set of requests, if request i submitting a bid 
(cbi, di,vbi) wins, then it continues to win if it instead submits a 
bid (a-, d'i,w'i) with w[ > w i9 a- < di, and d[ > di, assuming 
other bids are fixed. This condition can be easily verified. So, 
Auction 1 is DSIC. ■ 
By the definition of critical price, we propose Algorithm lV.il 
that applies binary search to find the critical price for requests 
scheduled by Algorithm lIV.ll Algorithm IV. 1 1 runs in each slot 
t when there are requests scheduled. In the binary search from 
Lines 6-12, scheduling decisions must be remade from t = di 
to di with Wi updated by the new value of Wi (Line 7) till the 
critical price is found. 

B. Non-negative Revenue for the Operator 

Our objective in Problem (A) is to maximize the social 
welfare without considering the revenue at the operator side. 
However, for an actual business model to be viable, it is 
important that the revenue of the operator is taken into account. 
The revenue is composed of two parts: Payments collected 
from the SUs by serving their requests and the penalty paid 
for causing collisions. Using Q(l — Po(k))/Po(k) as Q(k) 
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Algorithm V.l Critical Price Calculation for Requests 
In each time slot t: 
1: for alH e D do 

2: if i is scheduled by Algorithm II V. 1 1 then 



3: W iow <- 

4: W/m^ <- 

5: while w iow < w high do 

7: Run Algorithm IIV.1I with the valuation of request 

i updated by cpi from t = ai to <^ 

8: if i is scheduled then 

9: Wfci^ <- cpi 

10: else 

11: Wiow <~ cPi 

12: Output cpi as the critical price for i 



in Algorithm IIV.U the operator may get a negative revenue, 
which means the sum of payments by SUs does not exceed 
the penalty paid to PUs out of its control. To overcome 
this problem, we introduce a reservation price q, which is 
a constant for fixed channel related parameters. We further 
reset the value of 0{k) to be q in Algorithm IIV.1I and apply 
Auction 1. 

We interpret the reservation price as the expected cost 
associated with each request served by channels in T^. Hence, 
the revenue of the operator will always be non-negative if at 
least the reservation price is paid by the SU for getting its 
request served. Next, we show the form of the reservation 
price for the special case in Proposition IV. 21 

Proposition V.2. If\T\\ =0 and channels in T2 are homoge- 
neous, then a reservation price qo = Q(l — Pq)/Pq leads to 
non-negative revenue for the operator. 

Proof: The revenue of the operator is expected payment 
collected from SUs by serving their requests minus the cost. 
The payment of SUs is at least the reservation price Q(l — 
Pq)/Pq. By Lemma llVJI we know that the expected cost is 
Q(l — Pq)/Pq as well. Hence, the expected revenue is non- 
negative. ■ 
When there are heterogeneous channels, we try to find the 
average expected cost of serving a request and set it as the 
reservation price to guarantee a non-negative revenue. Recall 
that Ck is the expected cost of serving a request on channel h 
by Lemma IIV.U Assume that channels have been sorted by a 
non-decreasing order of c k . For ease of illustration, we index 
channels in T\ from 1 to \T\ |, and channels in T2 from \T\ | + 1 
to I Ti| + |^2 1 - Let n'j denote the expected fraction of requests 
served by channel j for a given set of requests. Then, the 

average expected cost per request is q' = Y c j n j- We 

JGT1UT2 

would like to have an estimate of q' that is independent of the 
request set. To this end, let v k denote the probability that the 
channel k is viewed as idle and it is really idle. That is, v k = 

7ri(fc) if k £ Ti, and v k = Pj(k)P (k) if k £ T 2 . Let m k = 



— ^j 2 — — . We use m k as the estimated fraction of requests 

1<ET 1 UT 2 

served by channel k. Then we let q\ = Y c j m j^ which 

jET 1 UT 2 

we use to estimate the average expected cost per request and 
set it as the reservation price. We would like to show that q\ > 
q' and hence using q\ as a reservation price, a non-negative 
expected revenue is obtained. We start with Lemma IV. II that 
provides a sufficient condition for q\ > q' . 

Lemma V.l. If < for all j, then q\ > q' . 

J m j+1 — n' j+1 J J ' ^1—^ 

Proof: We calculate q\ — q' = c j( m j ~ n 'j)- ^ n 

jeTtUT 2 

the following, we will show that there exists i such that for 
all j < i we have rrij < n' and for all k > i we have 

mk > n' h . Since ™ 3 < — r 2 - for all j, it is easy to see that: 

if rrij > n'j, then rrik > n' k by multiplying • • • anc ^ 

—^7—- • • • 7 fc , respectively, on both sides. Then we can find 

n j n k-i 

such i. We divide q\ — q' into two parts: 

qi - q' = y^Cj(mj - n'j) + ^c k (m k - n' k ) (13) 

j<i k>i 

If i = |Ti| + |T 2 |, qi-q'= E - n'j) > 

jET 1 UT 2 

( max Cj)( Y] rrin — Y] n'A = ( max c 7 -)(l — 

1) = 0. If i = 0, then all terms in q\ — q' are positive. Next 
we consider the case where neither sums in Equation (TT31) 
has no terms. Since all terms in the first term in the sum are 
non-positive and all terms in the second term in the sum are 
positive, we obtain 

qi-q' > (maxcj) V" (rrij - n'A + (mmc k ) V" (m k - n' k ) 

j<i * — ■ k>i — ■ 

j<i k>i 

> (rnaxc^) ^ ( m i ~ n 'j) + ( mfc ~ 

j<i k>i 

where (a) is by the assumption that c\ < ••• < C| Tl | + | T2 |. 
Hence q\ > q' holds. ■ 
Based on Lemma IV. II we claim that a reservation price of 
q\ results in a non-negative revenue for the operator. 

Proposition V.3. The operator achieves a non-negative ex- 
pected revenue with reservation price q\ when H is large 
enough. 

Proof: It suffices to show that q\ > q' . Consider any 
sample path. Without loss of generality, consider the first two 
channels in the sorted list. Let n\ and n2 denote the number 
of requests served by channels 1 and 2, respectively. Let Si 
denote the number of time slots that are sensed in the interval 
between (i — l)th and i-th requests served by channel 1, and 
define bi similarly for channel 2. Let A denote the total number 
of time slots between and H that are not sensed for channel 
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1 (because there is not enough requests or there is no benefit 
to sense it), and A' the number of time slots after the last 
request is served by channel 1. Define B and B' similarly 
for channel 2. Note that by the ordering of channels, when 
there is only one request in the system, and both channels are 
available, channel 1 will be used. It follows that A < B. We 
then have # = £ ^ + 4 + A' = + £ + Therefore, 
H = £Q>* +A + A') = E(n 1 )/m 1 + E(A) + E(A') 
(by geometric distribution) and H = E(ri2)/m2 + E(B) + 
E(B'). Note that E(A')/H -+ and E{B')/H -+ when 
H -+ oo. Therefore 4 = f£4 = [ff'fffil™ 1 > m since 

E{n 2 ) [H—E{B)\m 2 — m 2 

H - E(A) > H — E(B). It then follows that q x > q' by 
Lemma IV. II Hence, a reservation price of q\ leads to a non- 
negative revenue at the operator. ■ 
Remark: With a lower reservation price, the online algo- 
rithm tends to access the channels more aggressively, thus, the 
revenue of operator is harmed. On the other hand, a higher 
reservation price prevents more requests from accessing the 
channels, which harms the social welfare and further affects 
the revenue, as well. We will evaluate the tradeoff between 
social welfare and revenue by setting different reservation 
prices in Section PvTI 

VI. Numerical Result 

In this section, we evaluate the performance of the greedy 
online algorithm (Algorithm II V. 1 b and the tradeoff between 
social welfare and revenue for different reservation prices. We 
first show the competitive ratio of the greedy online algorithm 
with different channel settings and request related parameters, 
respectively. We then apply Auction 1 with varying reservation 
prices and show the performances of social welfare and 
revenue. 

A. Simulation Setting 

We let the arrivals ai of requests follow a Poisson dis- 
tribution and the duration di — ai of the requests follow 
an exponential distribution. The valuations follow a uniform 
distribution in [1,15]. We choose Q = 10, the penalty per 
collision, comparable to the valuations in all our simulations. 
We fix the number of requests as 20, and the inter-arrival 
mean as 3 slots, and vary the mean of request duration to 
adjust the density of requests. Given the means of inter-arrival 
and request durations, we generate 50 groups of requests 
and compare the average for the metrics we consider. We 
generate the channel availabilities in each time slot based 
on our assumption that channel states follow i.i.d Bernoulli 
distribution and 100 samples of channel realizations are taken 
for our simulations. The channel parameters we use will be 
introduced in Section IVI-BI 

B. Performance of Greedy Online Algorithm 

In Figure |2(a)[ we compare the performance of Algo- 
rithm IIV.1I with that of Algorithm MI. II when there are three 
homogeneous T 2 channels in the system with 7r 2 = 0.6324, 
Pm = 0.2218, P f = 0.6595 and various number of T x 
channels. The ?/-axis denotes the achieved competitive ratio, 




Request duration Request duration 



(a) Homogeneous T2 channels. (b) Heterogeneous T2 channels. 

Fig. 2. Performance of online algorithm versus offline algorithm over various 
request duration means with homogeneous and heterogeneous T2 channels 
(IT2I = 3), respectively. 

i.e., the ratio between the social welfare of the online algorithm 
and that of the optimal offline algorithm. When \T\ \ = 1, we 
set TTi = 0.5058; When \T X \ = 2, we set tti(1) = 0.8147 
and 7Ti(2) = 0.1270. We observe that the performance of 
Algorithm IIV. 1 1 degrades as |Ti| increases, independent of the 
request duration mean. With a high number of T\ channels, 
a wrong decision made by the greedy online algorithm to 
schedule a request affects the performance more. Also, the 
greedy online algorithm serves requests of a larger density 
better than requests of a smaller density. When the system is 
overloaded with requests, even the optimal offline algorithm 
can not satisfy all requests. Thus, those with larger valuations 
tend to be chosen, as in the greedy online algorithm. All ratios 
plotted are strictly above |, even for those with |Ti| 7^ 0. 

In Figure |2(b)| we evaluate the performance of Algo- 
rithm [IVI] with heterogeneous T 2 channels. We use the same 
T\ channel parameters as in the homogeneous case. The 
parameters related to T 2 channels are as follows: 7r 2 (l) = 
0.9134, tt 2 (2) = 0.6324, tt 2 (3) = 0.0975, P m (l) = 0.1419, 
P f (l) = 0.7922, P m (2) = 0.2218, P f (2) = 0.6595, 
P m (3) = 0.6557, P/(3) = 0.2157. We observe similar results 
as in Figure |2(a)| Algorithm IIV. II performs better with fewer 
Ti channels and denser requests. Again all ratios are above ~. 

C. Tradeoff between Social Welfare and Revenue 

We now study the tradeoff between social welfare and 
revenue generated by Auction 1. In Figure [3j we vary the 
values of reservation price. Note that it is now a constant 
over requests given the channel related parameters, which are 
the same as in Section IVI-BI We first show the tradeoff in 
a system with homogeneous T 2 channels and no T\ channels 
in Figure |3(a)[ Both social welfare and revenue first increase 
and then decrease as the reservation price increases. At a 
low reservation price (< 3), the payment collected cannot 
recover the expected cost and hence the average revenue 
becomes negative. A low reservation price may also hurt 
social welfare by our necessary condition for serving requests 
(Proposition MI. It . On the other hand, when the reservation 
price is too high, fewer requests will be accepted, which hurts 
both social welfare and revenue. We note that when the 
reservation price is go = Q(l — Po)/Po = 3.8, a non-negative 
revenue is obtained, which is consistent with Proposition IV. 2 1 
At a very high reservation price (> 8), the social welfare and 
the revenue converge, where the payment actually becomes 
the same as the valuation for requests served. Note that the 
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Reservation price Reservation price 

(a) Homogeneous T 2 channels (b) Heterogeneous T 2 channels 
with |Ti| = 0. with |Ti| = 1. 

Fig. 3. Tradeoff between social welfare and revenue over reservation price 
with homogeneous and heterogeneous T 2 channels (|T 2 | = 3), respectively. 

revenue never exceeds the social welfare by the definition of 
critical price. 

In Figure |3(b)| we show the tradeoff in a system with T\ 
and heterogeneous T2 channels. The trend of social welfare 
and revenue is similar to that in Figure |3(a)| Note that with 
the reservation price q\ = 6.9 (defined in Section [V]), the 
revenue obtained is right above 0, which is consistent with 
Proposition IV. 3 1 and also shows that q\ is nearly a tight upper 
bound of the expected cost for this case. 

VII. Conclusion 

In this paper, we study the joint sensing and spectrum 
allocation problem for serving secondary users in cognitive 
radio networks with the objective of maximizing the social 
welfare. Our problem formulation takes into account both 
spectrum uncertainty and sensing inaccuracy, which enables 
dynamic spectrum access at small time scales. Using only 
channel statistics and real time channel states, we develop an 
optimal solution for serving a given set of spectrum requests 
with various time elasticity. We further propose an online 
algorithm, which does not require future information on the 
arrival process, and achieves a comparable performance as 
the offline algorithm. In addition, we show that the online 
algorithm together with a payment scheme achieves incentive 
compatibility for the SUs and a non-negative revenue for the 
operator. There are several open problems to be solved. First, 
in practice, a more flexible form of spectrum requests will be 
desirable. For instance, a request may ask for multiple chunks 
that may or may not be preemptive. Extending the current 
offline and online algorithms to this more general setting 
will be part of our future work. Second, we plan to extend 
the problem formulation by including the notion of spatial 
spectrum reuse in addition to the time dimension considered 
in the paper. Third, we plan to relax the assumption on the i.i.d 
Bernoulli channels by considering correlated channels, which 
involves solving an exploration vs. exploitation problem in the 
context of an auction. 
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